Generalized Analysis of a Distribution Separation Method
نویسندگان
چکیده
Separating two probability distributions from a mixture model that is made up of the combinations of the two is essential to a wide range of applications. For example, in information retrieval (IR), there often exists a mixture distribution consisting of a relevance distribution that we need to estimate and an irrelevance distribution that we hope to get rid of. Recently, a distribution separation method (DSM) was proposed to approximate the relevance distribution, by separating a seed irrelevance distribution from the mixture distribution. It was successfully applied to an IR task, namely pseudo-relevance feedback (PRF), where the query expansion model is often a mixture term distribution. Although initially developed in the context of IR, DSM is indeed a general mathematical formulation for probability distribution separation. Thus, it is important to further generalize its basic analysis and to explore its connections to other related methods. In this article, we first extend DSM’s theoretical analysis, which was originally based on the Pearson correlation coefficient, to entropy-related measures, including the KL-divergence (Kullback–Leibler divergence), the symmetrized KL-divergence and the JS-divergence (Jensen–Shannon divergence). Second, we investigate the distribution separation idea in a well-known method, namely the mixture model feedback (MMF) approach. We prove that MMF also complies with the linear combination assumption, and then, DSM’s linear separation algorithm can largely simplify the EM algorithm in MMF. These theoretical analyses, as well as further empirical evaluation results demonstrate the advantages of our DSM approach.
منابع مشابه
Frequency Analysis of FG Sandwich Rectangular Plates with a Four-Parameter Power-Law Distribution
An accurate solution procedure based on the three-dimensional elasticity theory for the free vibration analysis of Functionally Graded Sandwich (FGS) plates is presented. Since no assumptions on stresses and displacements have been employed, it can be applied to the free vibration analysis of plates with arbitrary thickness. The two-constituent FGS plate consists of ceramic and metal graded thr...
متن کاملBuckling and Thermomechanical Vibration Analysis of a Cylindrical Sandwich Panel with an Elastic Core Using Generalized Differential Quadrature Method
In this paper, the vibrational and buckling analysis of a cylindrical sandwich panel with an elastic core under thermo-mechanical loadings is investigated. The modeled cylindrical sandwich panel as well as its equations of motions and boundary conditions is derived by Hamilton’s principle and the first-order shear deformation theory (FSDT). For the first time in the present study, various bound...
متن کاملParameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation
Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...
متن کاملOn generalized topological molecular lattices
In this paper, we introduce the concept of the generalized topological molecular lattices as a generalization of Wang's topological molecular lattices, topological spaces, fuzzy topological spaces, L-fuzzy topological spaces and soft topological spaces. Topological molecular lattices were defined by closed elements, but in this new structure we present the concept of the open elements and defi...
متن کاملBlind Deconvolution of Sources in Fourier Space Based on Generalized Laplace Distribution
An approach to multi-channel blind deconvolution is developed, which uses an adaptive filter that performs blind source separation in the Fourier space. The approach keeps (during the learning process) the same permutation and provides appropriate scaling of components for all frequency bins in the frequency space. Experiments indicate that Generalized Laplace Distribution can be used effective...
متن کاملThe Type I Generalized Half Logistic Distribution
In this paper, we considered the half logistic model and derived a probability density function that generalized it. The cumulative distribution function, the $n^{th}$ moment, the median, the mode and the 100$k$-percentage points of the generalized distribution were established. Estimation of the parameters of the distribution through maximum likelihood method was accomplished with the aid of c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Entropy
دوره 18 شماره
صفحات -
تاریخ انتشار 2016